The AT&t German text-to-speech system: realistic linguistic description

نویسندگان

  • Matthias Jilka
  • Ann K. Syrdal
چکیده

Like many current TTS systems the AT&T German text -tospeech system is based on the methods of unit selection and concatenative synthesis [1]. This paper highlights efforts to improve TTS quality by closely matching the speakers' original productions with linguistic descriptions. On the segmental level this is achieved by adjusting the speakers' individual productions to an established, general norm via strict monitoring and correspondingly by having the linguistic representations that control automatic alignment and TTS output, i.e. the recognition dictionary and letter-to-sound rules, reflect those original productions. The chosen standard represents a realistic form of spoken German, avoiding overly formal pronunciations. A perceptual comparison with a more traditional interpretation of German pronunciation demonstrates the positive effect of these measures on overall synthesis quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Phonetic Transcription of Non − Prompted Speech

Automatic Segmentation" (MAUS) system labels and segments the phonetic constituents of spoken German in a manner similar to highly trained phoneticians. MAUS has been used to train automatic speech recognition (ASR) systems as well as to provide detailed statistical analyses of spontaneous speech (using the Verbmobil I and RVG I corpora). The MAUS system is a reliable, automatic means of testin...

متن کامل

Issues In Text-To-Speech For French

This paper reports the progress of the French text-to-speech system being developed at AT&T Bell Laboratories as part of a larger project for multilingual text-to-speech systems, including languages such as Spanish, Italian, German, Rus-sian, and Chinese. These systems, based on di-phone and triphone concatenation, follow the general framework of the Bell Laboratories English TTS system [?], [?...

متن کامل

Rule-based Prosody Prediction for German Text-to-Speech Synthesis

This paper presents two empirical studies that examine the influence of different linguistic aspects on prosody in German. First, we analysed a German corpus with respect to the effect of syntax and information status on prosody. Second, we conducted a listening test which investigated the prosodic realisation of constituents in the German ’Vorfeld’ depending on their information status. The re...

متن کامل

The bell labs German text-to-speech system: an overview

In this paper we present an overview of the German version of the Bell Labs text-to-speech system, a high-quality concatenative synthesis system with extensive text analysis capabilities. We discuss problems of text analysis, and our solutions to these problems, including: the integration of text normalization tasks into linguistic text analysis; the capability to morphologically analyze compou...

متن کامل

Linguistic Means of Description of Family Relations in the Novel “In Chancery” By J. Galsworthy

The article is devoted to the study of the evaluative component of the meaning of lexical means used to describe relations between family members in the novel “In Chancery” by J. Galsworthy. The relevance of t &he study can be attributed to the lack of works devoted to this problem. As the results of our study demonstrate, the words of the lexical-semantic group “family” were mainly used to ver...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002